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^ . Abstract. We show that for any monoid M, the family of languages 

' accepted by M-automata (or equivalently, generated by regular valence 

grammars over M) is completely determined by that part of M which 
lies outside the maximal ideal. In particular, every such family arises 
as a family of languages accepted by iV-automata where N is a. simple 
, or 0-simple monoid. A consequence is that every such family is either 

exactly the class of regular languages, contains all the blind one-counter 
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languages, or is the family of languages accepted by G-automata where 
^ C| G is an non-locally-finite torsion group. 

We then consider a natural extension of the usual definition which 
permits the automaton to utilise more of the structure of each monoid, 
and additionally allows us to define S-automata for S an arbitrary semi- 
group. In the monoid case, the resulting automata are equivalent to 
the valence automata with rational target sets which arise in the theory 
1 of regulated rewriting systems. We study these automata in the case 

' that the register semigroup is completely simple or completely 0-simple, 

obtaining a complete characterisation of the classes of languages cor- 
0^ ' responding to such semigroups, in terms of their maximal subgroups. 

In the process, we obtain a number of results about rational subsets of 
00 . Rees matrix semigroups which are likely to be of independent interest. 

O 

o 



- 1—1 

X 



1. Introduction 

Recently there has been increasing interest in finite automata augmented 
with a memory register which stores at any moment in time an element of a 
given monoid M (or group G). The register is initialised with the identity 
element of the monoid, modified by right multiplication by monoid elements, 
and a word is accepted if and only if some computation reading the word 
returns the register to the identity with the finite state control in an accept- 
ing state. These automata are of considerable interest both to algebraists 
(who know them as blind monoid automata or M-automata) and to com- 
puter scientists (to whom they are often extended finite automata or valence 
automata). On the one hand, they provide algebraic characterizations of im- 
portant language classes such as the context free languages, the recursively 
enumerable languages and the blind and partially blind counter languages 
[13j E3 13 EI] • On the other, they give insights into computational prob- 
lems in group and monoid theory [H [13] . They are also closely related 
to regulated rewriting systems, and in particular to the valence grammars 
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introduced by Paun [18]; the languages accepted by M-automata are exactly 
the languages generated by regular valence grammars over M [6]. 

While M-automata with M a monoid appear at first sight to provide 
much more flexibility that their group counterparts, the extent to which 
such an automaton can fully utilise the structure of the register monoid 
is somewhat limited. Indeed, if the register ever contains an element of a 
proper ideal, then no sequence of actions of the automaton can cause it to 
contain the identity again; thus, the automaton has entered a "fail" state 
from which it can never accept a word. It follows that the automaton can 
make effective use of only that part of the monoid which does not lie in a 
proper ideal. This observation will be formalised below in Section [3l where 
we show that every M-automaton is equivalent to an ^-automaton where 
N is a simple or 0-simple monoid. 

A natural way to circumvent this is to weaken the requirement that the 
identity element be the sole initial and accepting configuration of the regis- 
ter, and instead permit more general sets of initial and terminal configura- 
tions. Permitting more general terminal sets was suggested by Gilman [7], 
and the idea has also recently appeared in the study of regulated rewriting 
systems, where the introduction of valence grammars with target sets leads 
naturally to a corresponding notion of valence automata with target sets 
[5J E] . An additional advantage of allowing more general initial and accept- 
ing sets is that we remove entirely the special role played by the identity, 
and hence the very need for identity. The resulting automata are thus not 
constrained to have register monoids, but instead can use arbitrary semi- 
groups. 

If we are to retain the advantages of monoid automata as an elegant and 
easily manipulated way of describing important language classes, it is clearly 
necessary to place some kind of restriction on the class of subsets permitted 
for initial and terminal configurations. Possible choices include the finite 
subsets or the finitely generated subsemigroups, but from a computational 
perspective, the most natural choice seems to be the more general ratio- 
nal subsets of the semigroup. These sets, which have been the subject of 
intensive study by both mathematicians and computer scientists (see for 
example [TJ HSJ HHJ E2J [23] ) , are general enough to significantly increase the 
descriptive power of the automata, while remaining sufficiently well-behaved 
to permit the development of a meaningful theory. 

The objective of this paper, then, is to begin the systematic study of 
finite automata augmented with a semigroup register with rational initial 
and accepting subsets. In Section [21 we briefly recall some elementary defi- 
nitions from the theories of formal languages, monoid automata and rational 
subsets. In Section [3j we motivate the more general constructions which fol- 
low, by exhibiting a number of limitations on the capability of conventional 
M-automata to describe language classes. 

In Section H] we formally define rational semigroup automata, and obtain 
some foundational results about these automata and the classes of languages 
which they define. In Section [5] we turn our attention to Rees matrix con- 
structions. We first study the relationship between rational subsets and Rees 
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matrix constructions, obtaining a number of results which may be of inde- 
pendent interest. Some of these results are then combined with a classical 
theorem of Rees [20] to yield a complete description of the classes of lan- 
guages accepted by rational 5-automata whenever S is a completely simple 
or completely 0-simple semigroup. 

2. Preliminaries 

Firstly, we recall some basic ideas from formal language theory. Let X 
be a finite alphabet. We denote by X* the set of all words over X and 
by e the empty word. Under the operation of concatenation and with the 
neutral element e, X* forms the free monoid on X. The set X* \ {e} of 
non-empty words forms a subsemigroup of X*, called the free semigroup X + 
on X. A finite automaton over X* is a finite directed graph with each edge 
labelled with an element of S, and with a distinguished initial vertex and 
a set of distinguished terminal vertices. A word w £ X* is accepted by the 
automaton if there exists a directed path connecting the initial vertex with 
some terminal vertex labelled cumulatively with w. The set of all words 
accepted by the automaton is denoted L or for an automaton A sometimes 
L(A), and is called the language accepted by A. A language accepted by a 
finite automaton is called rational or regular. 

More generally, if S is a semigroup then a finite automaton over S is a 
finite directed graph with each edge labelled with an element of S, and with 
a distinguished initial vertex and a set of distinguished terminal vertices. An 
element s £ S is accepted by the automaton if there exists some directed 
path connecting the initial vertex to some terminal vertex, the product of 
whose edge labels is s. If S is a monoid then we admit a unique empty path 
at each vertex with label the identity element; otherwise we consider only 
non-empty paths. The subset accepted is the set of all elements accepted; a 
subset of S which is accepted by some finite automaton is called a rational 
subset. The rational subsets of S are exactly the homomorphic images in S 
of regular languages. 

We shall require the following result about rational subsets of groups, 
which is well-known and easy to prove. 

Proposition 2.1. Let G be a group. If X C G is rational then the subset 
X^ 1 = {x^ 1 | x € X} is also rational. 

Next we recall the usual definition of a monoid automaton. Let M be a 
monoid with identity 1 and let X be an alphabet. An M-automaton over X 
is a finite automaton over the direct product MxE*. We say that it accepts 
a word w £ X* if it accepts (l,w), that is if there exists a path connecting 
the initial vertex to some terminal vertex labelled (l,w). Intuitively, we 
visualize an M-automaton as a finite automaton augmented with a memory 
register which can store an element of M; the register is initialized to the 
identity element, is modified by right multiplication by element of M, and 
for a word to be accepted the element present in the memory register on 
completion must be the identity element. We write F\(M) for the class 
of all languages accepted by M-automata, or equivalently for the class of 
languages generated by regular valence grammars over M [6]. 
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3. Monoid Automata and Simple/O-Simple Semigroups 

The aim of this section is to show that the extent to which an M- 
automaton can make use of the structure of a general monoid M is severely 
limited. In particular, we formally justify and consider the consequences 
of our intuitive observation, made in the introduction, that a monoid au- 
tomaton uses only that part of the monoid which does not lie in a proper 
ideal. 

Recall that an ideal I of a semigroup (or monoid) S is a subset I of S 
with the property that S^/S* 1 C /, where S* 1 denotes S with an identity 
element adjoined. To each ideal I is associated a congruence pi on S such 
that (s, t) £ pi if and only if either s, t £ / or s = t. The quotient semigroup 
S/ pi, usually just denoted S/I, is called a Rees quotient, and takes the form 

S/i = {/} u {{x} | x e S \ I}. 

It is usual to identify {x} with x for each x £ S\I; the element /, which is 
easily seen to be a zero element in S/I, is often denoted 0. 

Proposition 3.1. Let I be a proper ideal of a monoid M. Then F\(M) = 
F 1 (M/I). 

Proof. Suppose L £ F\(M), and let A be an M-automaton accepting L. 
First notice that any path containing an edge of the form {x, w) with x £ I 
will itself have label with first component in /; in particular, since / is a 
proper ideal, 1^1 and such a path cannot be an accepting path. It follows 
that we may remove any such edges without changing the language accepted, 
and so that we may assume without loss of generality that A has no such 
edges. Now for any x\, . . . , x n £ M\I, it follows from the definition of M/I 
that xi . . . x n = 1 in M if and only if {xi} . . . {x n } = {1} in M/I. Now if 
we let B be the (M//)-automaton obtained from A by replacing edge labels 
of the form (x, w) with ({x},w), it follows easily that A has a path from 
the initial vertex to a terminal vertex labelled (l,w) if and only if B has a 
path from the initial vertex to a terminal vertex labelled ({l},w). Thus, B 
accepts the language L and L G F\(M/I). 

Conversely, if L £ F\{M /I) then L is accepted by some (M/I)-automaton; 
by an argument akin to that above, we may assume without loss of general- 
ity that B has no edges labelled by the zero element I. We now obtain from 
B a new M-automaton A by replacing edges labels of the form ({x}, w) with 
(x,w), and argue as above to show that A accepts exactly the language L, 
so that L £ F\(M). □ 

A semigroup or monoid is called simple if it does not contain any proper 
ideals; similarly a semigroup with a zero element is called ^-simple if its 
only proper ideal is {0}. 

Corollary 3.2. For every monoid M there is a simple or 0-simple monoid 
N such that Fi(M) = iq(iV). 

Proof. If M has no proper ideals then it is simple, so we are done. Otherwise, 
let / be the union of all the proper ideals of M. Then / is an ideal and, 
since the identity element 1 does not lie in any proper ideal, 1 ^ / and / 
is a proper ideal of M. Setting iV = M/I, it is easily verified that iV is 
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0-simple, and by Proposition O we have Fi(M) = Fi(M/J) = F 1 (N) as 



Corollary 13,21 tells us that the usual theory of M-automata really only 
involves the very restricted classes of simple and 0-simple monoids. 

For S a semigroup, we denote by S° the semigroup with a zero element 
adjoined, that is, with elements S U {0} where is a new symbol not in S, 
and multiplication given by 



We recall the following easy result from [2T] which says that adjoining a zero 
to a monoid M makes no difference to the class of languages accepted by 
M-automata. 

Proposition 3.3. Let M be a monoid. Then Fi(M°) = F 1 {M). 

It follows from standard results in semigroup theory (see [2J Theorem 
2.54]) that a simple [0-simple] monoid is either a group [respectively, a group 
with adjoined] or contains an embedded copy of the bicyclic monoid B. In 
[21] we observed that the languages accepted by 5-automata are exactly the 
partially blind one-counter languages, while those accepted by Z-automata 
are exactly the blind one-counter languages; both of these classes were intro- 
duced and studied by Greibach [10] . Combining this with Propositions 13.11 
and 13.31 we obtain now the following. 

Theorem 3.4. Let M be a monoid. Then either F\(M) = F\{G) for some 
group G, or F\(M) contains the partially blind one-counter languages. 

Recall that a group G is called torsion if every element has finite order, 
and locally finite if every finitely generated subgroup is finite. A straight- 
forward consequence of Theorem 13.41 is the following trichotomy. 

Theorem 3.5. Let M be a monoid. Then F\(M) either 

(i) is exactly the class of regular languages; or 

(ii) contains the class of blind one-counter languages; or 

(iii) is equal to F\ (G) for G a torsion group which is not locally finite. 

Proof. By Theorem l3.4l either F\ (M) contains the partially blind one-counter 
languages, or F\{M) = F\(G) for some group G. In the former case, 
since the class of partially blind one-counter languages contains that of 
blind one-counter languages, it is immediate that (ii) holds. So suppose 
F\(M) = F\(G) for some group G. 

If G is not a torsion group then it has an element of infinite order; this 
element generates a subgroup isomorphic to Z, from which it follows that 
F\{G) contains the class -Fi(Z) of blind one-counter languages and (ii) again 
holds. Now by \14\ Proposition 1], every language in F\(G) is in F\(H) for 
some finitely generated subgroup of H of G. If G is locally finite, then such 
an H must be finite, and so every language in F\(G) is regular. Conversely, 
F\{G) certainly contains the regular languages, so (i) holds. There remains 
only the case in which G is a torsion group which is not locally finite, in 
which case (iii) holds. □ 



required. 
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the S-product st if s, t £ S; 
otherwise. 
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Elder and Mintz [3j have observed that for G a finitely generated infinite 
torsion group, F\[G) cannot contain the blind one-counter languages, but 
does always contain non-regular languages (for example, the word problem 
of G) . It follows that the three possible conditions in Theorem 13.51 are in 
fact mutually exclusive. The theorem is of particular interest because torsion 
groups which are not locally finite are rather rare and difficult to construct. 
It would be interesting to study the language classes F\{G) corresponding 
to particular known examples of such groups j9[ HU Q2J . 

4. Rational Semigroup Automata 

In Section [3l we saw that the extent to which traditional monoid automata 
can utilise the differences in structure between groups and monoids was 
limited. In this section, we consider a generalisation which allows us to 
utilise more of the structure of arbitrary monoids, and indeed semigroups. 

Let S be a semigroup and X a finite alphabet. We define a rational S- 
automaton over X to be a finite automaton over the direct product S 1 x X* 
together with two rational subsets Xq,X\ C S called the initial set and 
terminal set respectively. The automaton accepts a word w G X* if there 
exists xq G Xq and x G S such that x^x G X\, and (x, w) labels a path from 
the initial vertex to a terminal vertex in the automaton. For S a semigroup, 
we let Ffiat(S) be the class of languages accepted by rational S'-automata. 

Intuitively, a rational S-automaton is a non-deterministic finite automa- 
ton augmented with a register which stores an element of S. The register 
is (non-deterministically) initialised with an element of Xq, and a word is 
accepted if there is a computation which reads it and leaves the finite state 
control in an accept state and the register containing a value from X\. 

Note that if S = M is a monoid then a rational M-automaton with initial 
set {1} is just an valence automaton over M with rational target set of the 
kind studied by Fernau and Stiebe [5] and the present authors [21j . Indeed, 
the following proposition says that, for M a monoid, the initial set can be 
taken to be {1} without loss of generality. 

Proposition 4.1. Let M be a monoid with identity 1, and L C X* a lan- 
guage. If L G Fji a t(M) then L is accepted by a rational M-automaton with 
initial set {1}, that is, by an valence automaton over M with rational target 
set. 

Proof. Let A be a rational monoid automaton with initial set Xq C M and 
terminal set X\ C M which accepts the language L. Let L' C M x X* be 
the full subset accepted by A interpreted as an automaton over M x X*. 
Since Xq C M is rational, the set 

X = {(x,e) | x € X } C M x X* 

is rational. Now let 

K = X'qL' = {(xqX, w) I Xq G Xq, (x, w) G L'}. 

Then w G L if and only if there exists xq G Xq and x G X such that 
(x,w) G V and xqx G X\. But this is true exactly if (x',w) G K for some 
x' G Xl 
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Now K is a product of two rational subsets, and hence is a rational subset. 
Let B be a finite automaton over M x X* recognizing K. If we interpret 
B as a rational M-automaton with initial set {1} and terminal set X±, it is 
immediate that B accepts exactly the language L. □ 

Combining Corollary 14.11 with a result of Fernau and Stiebe [5] we obtain 
the following. 

Theorem 4.2. If G is a group then Fn at {G) = F\(G). 

Proof. If L G Fnat{G) then by Corollary 14.11 L is accepted by a rational G- 
automaton with initial set {1} and some rational terminal set Xi, that is, by 
a valence automaton with rational target set. But now by [5j Theorem 3.5], 
L is accepted by a finite valence automaton, that is, a G- automaton, so that 
L is in F\(G) as required. The converse is immediate. □ 

We now turn our attention to the relationship between rational relations 
and rational semigroup automata. Let Q and X be finite alphabets, and 
consider a finite automaton over the direct product f2 + x £*; it recognizes a 
rational relation R C Q + x S*. The image of a language L C il + under the 
relation R is the set of words y G X* such that (x, y) G i? for some iGL, 

If Xo , Xi C S 1 then their difference is the set 

X^ Xi = {x e S \ xqx = x\ for some xq G Xo,xi G Xi}. 

We say that a subset X C S 1 is a rational set difference if there exist ra- 
tional subsets Xq,Xi C 5 such that X = X c j~ 1 Xi. Note that in a group, 
the rational set differences are exactly the rational subsets, but in a gen- 
eral semigroup this does not hold. The following statement is a semigroup 
analogue of |2H Proposition 3.1], which in turn generalises a well-known 
observation concerning M-automata (see for example [14|, Proposition 2]). 

Proposition 4.3. Let Xq and X\ be subsets of a semigroup S, and let 
LCS* be a language. Then the following are equivalent: 

(i) L is accepted by a S-automaton with initial set Xq and terminal set 
Xi; 

(ii) there exists a finite alphabet $7, a morphism to : J7 + — > S and a 
rational relation p C £1+ x S* such that 

l = (x^xxV-y 

If S is finitely generated then the following condition is also equivalent to 
those above. 

(iii) for every finite choice of generators lo : £l + — > S for S, there exists 
a rational relation p C fi+ x X* suc/i that 

l = (x^xiv-y 

Proof. To show that (i) implies (ii), suppose that L is accepted by an S- 
automaton A with initial set Xo and terminal set X\. Choose a finite al- 
phabet SI and a map uj : S7 + — ► S 1 such that the image £l + u> contains every 
element of S which forms the first component of an edge label in the au- 
tomaton. We now obtain from A a finite automaton B over f) + x X* by 
replacing each edge label (s, x) with (w, x) for some w G f2 + is such that 
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wuj = s. It is a routine exercise to verify that L is the image of (Xq X\)u)~ l 
under the relation accepted by B. 

Conversely, suppose we are given a map uj : £l + — > S and an automaton B 
over f2 + x X* such that L is the image under the relation accepted by B of 
the language (Xq 1 Xijuj^ 1 . We construct from B a new automaton A over 
5 x X* by applying the map uj to the first component of each edge label. Now 
interpreting A as an S'-automaton with initial set Xq and terminal rational 
set X\, it is easily seen that A accepts the lamguage L. 

Suppose now that S is finitely generated. Clearly (iii) implies (ii). Con- 
versely, if (i) holds then we can extend uj arbitrarily to a finite choice of 
generators uj' : (^') + — > S for M, and check that we still have the desired 
property, so that (iii) holds. □ 

As a corollary, we immediately obtain the following characterisation for 
language classes of the form Fnat(S). 

Proposition 4.4. Let S be a semigroup and L C X* a language. Then the 
following are equivalent. 

(i) L E F Rat (S); 

(ii) there exists an alphabet Q, a morphism uj : f2 + — > S, a rational set 
difference X C S and a rational relation p C il + x X* such that 
L = Xuj- x P . 

If S is finitely generated then the following condition is also equivalent to 
those above. 

(iii) there exists a rational set difference X C S such that for every fi- 
nite choice of generators uj : f2 + — ► S for S, there exists a rational 
relation p C il + x X* such that L = Xuj~ 1 p. 

Note that, unlike in the monoid case [21], we cannot conclude that Fn at (S) 
is a rational cone. This is because the composition of a rational relation in 
Q + x X* with a rational transduction from X* to another free monoid T* 
need not be a rational relation in f2 + x T* (although it will be rational in 

q* x r*). 

5. Rees Matrix Constructions, Completely Simple and 
Completely O-Simple Semigroups 

In this section we apply the results of the previous sections to obtain a 
description of language classes F^ t (S) for semigroups S belonging to the 
important classes of completely simple and completely 0-simple semigroups. 

Recall that an idempotent e in a semigroup is called primitive if for every 
non-zero idempotent / such that ef = fe = f we have e = /. A semigroup is 
completely simple [respectively, completely 0-simple] if it is simple [0-simple] 
and has a primitive idempotent. For more information about completely 
simple and completely 0-simple semigroups, see [T2"] . 

Now let T be a semigroup, be a new symbol not in T and let I, J be 
non-empty sets. Let P = (Pji) be a J x / matrix with entries in T U {0}. 
We define a new semigroup with set of elements 

(/ x T x J) U {0} 
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and 



(M,j)o 



0(i,t,j) = 00 = 0. 



It is simple to verify that this binary operation is associative; we call the 
semigroup constructed in this way a Rees matrix semigroup with zero over T, 
and denote it M°(T; /, J; P). The semigroup T is called the base semigroup 
and the matrix P the sandwich matrix of the construction. If P contains no 
zero entries then L x T x J forms a subsemigroup of M°(T; /, J; P), called 
a Rees matrix semigroup (without zero) over T and denoted M(T; I, J; P). 

Rees matrix semigroups play a crucial role in much of the structural theory 
of semigroups. Of particular importance is the case that the base semigroup 
T is a group G. A Rees matrix semigroup with zero over a group is called 
regular if every row and every column of the sandwich matrix contains a 
non-zero entry. The importance of this construction can be seen from the 
following seminal result of Rees [20J . 

Theorem 5.1 (The Rees Theorem). Let S = M(G; I , J; P) [respectively, 
S = M°(G; I, J; P)] be a [regular] Rees matrix semigroup over a group. 
Then S is a completely simple [respectively, completely 0-simple] semigroup. 
Conversely, every completely simple [completely 0-simple] semigroup is iso- 
morphic to one constructed in this way. 

We shall need the following proposition. 

Proposition 5.2. Let S = M(T;L,J;P) or S = M°(T;L,J;P) be a Rees 
matrix semigroup with or without zero over a semigroup T. Let X C S be a 
rational subset and suppose i £ I and j G J. Then the set 



is a rational subset ofT. 

Proof. Let A be a finite automaton over S accepting the rational subset X, 
with vertex set Q. Let J' be the set of all j € J such that A has an edge 
label with third component j; note that J' is necessarily finite. We construct 
from A a new finite automaton B over T with 

• vertex set {Q x J') U {q' } where q' Q is a new symbol; 

• start vertex q' ; 

• terminal vertices (q,j) such that q is a terminal vertex of A; 

• an edge from q' to (qi,ji) labelled t\ whenever A has an edge from 
the initial vertex to q\ labelled (i,t%,ji); 

• for every j% G J', an edge from to ((72,^2) labelled Pj 1 i 2 t2 
whenever A has an edge from q\ to q2 labelled (12^2,32) with Pj^ ^ 



Since J' is finite and A has finitely many vertices and edges, we deduce 
that B has finitely many vertices and edges. Now we show that the subset 
accepted by B is exactly Xij. Let t G X. L j. Then (i, t, j) G X labels a path in 



X l3 = {gET\(i,g,j) G X} C T 



0. 
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A from the initial vertex to some terminal vertex. Clearly this path cannot 
contain edges labelled 0, so it must have the form 

(*2,*2j2) («3,*3,i3) («m-l,(m-ljm-l) {im,tm,jm) 

PO ► Pi ► P2 ► • • • ► Pm-1 > Pm 

where po is the initial vertex of A and p m is a terminal vertex. Since the 
path is labelled (i,t,j) we must have 

(M>j) = (h,ti,ji)(i 2 ,t2,h)---(i m ,t m ,jm) 

so that i\ = i, j m = j. Now it follows easily from the construction of B that 
it has a path 

QO — > (Pl,Jl) > (P2,j2j • • • (Pm-l,Jm-l) ► (Pm,j), 

where (p m ,j) is a terminal vertex of B, so that B accepts 

t = t\Pj l i % t2Pj 2 i 3 ■ ■ ■ Pj m -ii m tm- 

Thus Xij C L(S). 

Conversely, assume that t G T is accepted by B. Then there exists a path 
through -B from the initial vertex to some terminal vertex labelled with t. 
It follows from the definition of B that this path must have the form 

QO — * (Pl,Jl) > (P2,J2J • • • (Pm-l,Jm-l) > (Pm,j), 

where p m is a terminal vertex in A, 

t = tiPjifatzPfaigts . . . Pjm^ximtm 

and A has a path 

(*>*ljl) («2,<2,i2) (*3,<3,i3) («m-l,(m-l,Jm-l) (im,tm,j) 

PO > Pi > P2 > • • • > Pm-1 * Pm 

where po is the initial vertex of A. Hence, A accepts the element 
(i,t 1 ,j 1 )(i 2 ,t 2 

= (i,t,j). 

So (i,t,j) G X and hence t G Xjj. 

So the automaton B accepts exactly the set X^, and hence Xij is a 
rational subset of T. □ 

As a corollary, we obtain a result about the intersections of rational sub- 
sets with maximal subgroups in completely simple semigroups. 

Corollary 5.3. Let H be a maximal subgroup of a completely simple or 
completely 0-simple semigroup S. Let X be a rational subset of S. Then 
X n H is a rational subset of H . 

Proof. By the Rees theorem, we may assume that 5 is a Rees matrix semi- 
group without zero M (G; I, J; P) or a regular Rees matrix semigroup with 
zero S = M°(G; I, J; P) over a group G. It follows easily from the definition 
of the Rees matrix construction that either H = {0} or 

H = {{i,g,j)\geG} 

for some % G I and j G J with Pji 7^ 0. In the former case the result is 
trivial, so we assume the latter. By Proposition 15.21 the set 

= {geG\(i,g,j)eX} = {<7 G G I (i, g, j) £ H f] X} 
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is a rational subset of G. It follows that 

PjiXij = {P jig | g G Xij} = {P jig | (i,g,j) G X} 
is also a rational subset of G. Now define a map 

4> : G^H, g^&P^gJ) 

where PJ^ is the inverse of Pji in the group C It is readily verified that <f> 
is an isomorphism from G to H, and so the image 

(PjiXij = {{i,P j : 1 g,j)\geP ji X ij } 

= {(i,P^P jig ,j) | (i,g,j) ex} 

= {(i,9,j) I (i,9,j) G X} 

= inff 

is a rational subset of G, as required. □ 

In a completely simple semigroup, where every element lies in a maximal 
subgroup, Corollary 1 5 . 3 1 easily yields the following complete characterisation 
of rational subsets. 

Theorem 5.4. The rational subsets of a completely simple semigroup are 
exactly the finite unions of rational subsets of maximal subgroups. 

Proof. Let S be a completely simple semigroup. If X%, ... ,X n are rational 
subsets of maximal subgroups of S then certainly they are rational subsets 
of S, and so is their union. Conversely, suppose X is a rational subset 
of S. It follows easily from the Rees theorem that X lies inside a finitely 
generated completely simple subsemigroup S' of S. Now S' is the union of 
finitely many maximal subgroups, so X is the union of its intersections with 
these subgroups. By Corollary 15.31 these intersections are rational, so X is 
a finite union of rational subsets of maximal subgroups of S' . But maximal 
subgroups of S' are subgroups of S, and hence lie in maximal subgroups 
of S' . It follows that X is a finite union of rational subsets of maximal 
subgroups of 5, as required. □ 

Proposition 5.5. Let S = M(T;I,J;P) or S = M°(T;I,J;P) be a Rees 
matrix semigroup with or without zero over a semigroup T, and let P' CT 
be the set of non-zero entries of the sandwich matrix P. Suppose T = P'T 
or T = TP' . Then for any i G / , j G J and rational subset X of T , the set 

{(i,t,j) \ t£X} 

is a rational subset of S. 

Proof. By symmetry of assumption, it suffices to consider the case in which 
T = P'T. Let A be a finite automaton over T accepting X, with vertex set 
Q. Let FCTbe the set of edge labels in A, and for every t G Y, let jt G J, 
it G I and St G T be such that t = Pj t i t st- Let J' = {jt \ t G Y} U {j}. Then 
J' is a finite subset of J. We define a new automaton B over S with 

• vertex set (Q x J') U {go} where qo is a new symbol; 

• initial vertex go; 

• terminal vertices (q,j) such that q is a terminal vertex of A; 
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• for every edge in A from the start vertex to a vertex q labelled t, 
and every j' G J', an edge from qo to (q,f) labelled (i,t,f); 

• for every edge in A from a vertex p to a vertex q labelled t, and every 
j' G J', an edge from (p,j t ) to (q,j') labelled (i t ,s t ,j'); 

A routine argument, akin to that in the proof of Proposition shows that 
B accepts the required subset of S. □ 

Note in particular that the conditions on the sandwich matrix in the 
hypothesis of Proposition 15.51 are satisfied in the case of a regular Rees 
matrix construction over a group. 

Recall that the rational subset problem for a finitely semigroup S is the 
algorithmic problem of deciding, given a rational subset (described as a finite 
automaton over a fixed generating set for S) and an element (described as 
a word over the same generating set), deciding whether the latter belongs 
to the former. While the phrasing of the problem is dependent on the 
precise choice of finite generating set, the decidability or undecidability of 
the problem is independent of this choice \lb\ Corollary 3.4], so one can 
meaningfully say that the abstract semigroup S has decidable or undecidable 
rational subset problem. 

Corollary 5.6. Let S = M{T;I,J;P) or S = M°(T;I,J;P) be a finitely 
generated Rees matrix semigroup with or without zero over a semigroup T. If 
T has decidable rational subset problem then S has decidable rational subset 
problem. 

Proof. We prove the statement for Rees matrix semigroups with zero. The 
result for Rees matrix constructions without zero can be obtained as an easy 
consequence, or proved directly using a similar method. 

Let lo : Q* — > T and a : X* — > S be finite choices of generators for T and 
S respectively. For every x G £ such that xa ^ 0, suppose xa = (i x ,g x ,j x ) 
and let w x G Q* be a word with w x u) = g x . For j G J and i E I such that 
Pji 7^ let Wji G fi* be a word with WjiUJ = Pj{. 

Now suppose we are given a word w = w\ . . . w n G £*, where each u>i G S, 
and a rational subset X of S. Clearly, we can test whether w represents 
and, in the case that it does, whether G X. Assume now that w does not 
represent 0. Then 

wuj = (wiuj) . . . (w n uj) = {i Wl ,g Wl Pj wl i W2 g W2 ■ ■■9w n J Wn )- 

Let Y = {t G T | (i Wl ,t, j Wn ) G X}, so that wuj G X if and only if 

( w g wi w j wl i W2 w gw2 . . . w gwn )a = g Wl Pj wl i W2 g W2 ■ ■ ■ 9w n e Y. (1) 

Now by Proposition [521 Y is rational and it follows moreover from the proof 
that we can effectively compute an automaton for Y. By assumption, we 
can solve the rational subset problem for Y, so we can decide whether ([T]) 
holds, as required. □ 

We now turn our attention to languages accepted by rational S'-automata, 
where S is a Rees matrix semigroup. We begin with a lemma which simplifies 
the case of Rees matrix semigroups with zero, by allowing us to restrict 
attention to automata for which neither the initial set nor the terminal set 
contains the zero element. 
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Lemma 5.7. Let S = M°(T; I, J; P) be a finitely generated Rees matrix 
semigroup with zero over a semigroup T . If L is accepted by a rational S- 
automaton, then L is accepted by a rational S-automaton for which neither 
the initial set nor the terminal set contain 0. 

Proof. Suppose L is accepted by a rational S-automaton A with initial set 
Xq and terminal set X\. Suppose first that £ Xq. If also G X\ then we 
have Ox € X\ for all x € S, so the language accepted is just the set of all 
words w such that (x, w) labels a path from the initial vertex to a terminal 
vertex of A for some x € S. It follows that L is regular, and hence lies 
in Fi(S). On the other hand, if ^ X\ then there is no x € S such that 
Ox S X\\ hence we may replace the initial set Xq with Xq \ {0} without 
changing the language accepted. Thus, we may assume that ^ Xq. 

Clearly we can write L = Lq U L\ where L\ is accepted by a rational 
S'-automaton with not in the initial or terminal sets, and Lq is accepted 
by a rational S'-automaton with terminal set 0. We claim that Lq is regular; 
it will follow that L is the union of L\ with a regular language, and so can 
clearly be accepted by a rational S-automaton without in the terminal set. 

Let u : f2* — > S be a finite choice of generators for S. For each i£S! such 
that xu> ^ suppose xu> = (i x ,9x,jx)- Now let K be the set of all words 
representing elements of the initial set of A, and let K' be the (necessarily 
finite) set of all final letters of words in K. It is easily seen that the language 

{d S H* | (wv)uj = for some w £ K} 

is regular. Indeed, it consists of all words which 

• contain a generator representing zero; or 

• contain consecutive generators x and y with Pj x i = 0; or 

• start with a generator y with Pj x i y = for some x € K' 

and so can be easily described by a regular expression. It now follows from 
Proposition 14.31 that Lq is a rational transduction of the above regular lan- 
guage and hence is itself regular. This completes the proof. □ 

We are now ready to prove the main theorem of this section, the essence 
of which is that rational S-automata where S is a completely simple or com- 
pletely 0-simple semigroup are no more powerful than G-automata where G 
is the maximal subgroup of S. 

Theorem 5.8. Let S be a completely simple or completely 0-simple semi- 
group with maximal non-zero subgroup G. Then 

F Rat (S) = F Rat (G) = F 1 (G). 

Proof. That F Ra t(G) = F^G) is TheoremS^ while the inclusion F Rat (G) C 
FRat(S) is immediate. Hence, we need only prove that F Rat (S) C F Rat (G). 
It follows easily from the Rees theorem that every completely simple semi- 
group S embeds in a completely 0-simple semigroup S' with the same max- 
imal non-zero subgroup, so that F Rat (S) C F Rat (S'). Hence, it suffices to 
prove the result in the case that S is completely 0-simple. 

Suppose, then, that S is completely 0-simple. By the Rees theorem, 
we may assume that S is a regular Rees matrix semigroup M°(G°; /, J; P) 
where G is a group. Suppose now that a language LCE* lies in Fjiat(S). Let 
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A be a rational S'-automaton accepting L, with initial rational set Xq C S" 
and terminal rational set X\ C 5\ By Lemma 15.71 we may assume that 
£ X and X x . 

Let C and D be automata over S accepting Xq and X\ respectively. Since 
C, D and A have only finitely many edges between them, we may choose 
finite subsets I' C I and J' C J such that the edge labels of C and D all lie 
in I' x G x J', and the edge labels of A all lie in (/' x G x J') x E*. 

For each i £ I' and j € J', we let Jfy = {g G G | (i,g,j) G -Xo}- By 
Proposition 15.21 each is a rational subset of G. It follows that 

X'v = Xa x {e} 

is a rational subset of G x S*; let Cij be an automaton accepting X[-. 

Similarly, for each i G I' and j G J' we define Yij = {g~ l G G \ (i,g,j) € 
X±}. By Propositions 15.21 and 12.14 is a rational subset of G, and so 

y/. = Fjj x {e} 

is a rational subset of G x S*; let Dij be an automaton accepting Y(-. 

Assume without loss of generality that the automaton A and all the au- 
tomata Cij and Dij have disjoint vertex sets. We construct from these 
automata a G-automaton B with 

• vertex set the union of the vertex sets of Cij and Dij (for i 6 I' and 
j G J') together with I' xQ x J' where Q is the vertex set of A, and 
a new vertex gg; 

• initial vertex q' Q ; 

• terminal vertices the terminal vertices of the automata D^; 

• all the edges of the automata Cij and D^; 

• for each i G I' and j G J', an edge from q' to the initial vertex of 
Cij labelled (1, e); 

• for each i G /' and jf G J' , an edge from each terminal vertex of Cij 
to (i,qo,j) labelled (1, e), where go is the initial vertex for A; 

• for each edge in A from p to q labelled ((i, g, j),w) and each i' G I' 
and j' G J', an edge from (i',p,j') to (i',q,j) labelled (Pj/ig,w); 

• for each i G i 7 , j G J' and terminal vertex p of ^4, an edge from 
(i,p,j) to the initial vertex of D^ labelled (1, e). 

Since J' and all the automata A, Cij and D^ are finite, it follows that 
the G-automaton B is finite. We now show that B accepts the language L. 

Let w G L. Then there exists a path through the automaton A labelled 
((i,g,j), w) connecting the initial vertex with some terminal vertex (pt say), 
such that 

for some (io,go,jo) £ Xq. Suppose this path has the form 

((*liffl iil),*x) {(i2,92,j2),X 2 ) ((«3,S3J3),^3) {(im,gm,j m ),X m ) 

qo *• qi >■ q2 *• • • • q m i > q m 

where go is the initial vertex and q m = pt is a terminal vertex of A and 
io = x\ . . . x m . Note that we must have i' = io, j' = j m and 

9 = 9l p hh92 ■ ■ ■ Pj m _ 1 i m g m - 
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Now by construction, B has a path tt2 of the form 

,. . \ (Pj h9i,xi) ,. . s (P n i 2 92,X2) . (P j2 i 3 g3,x 3 ) 

(io,qo,jo) ► (io,qi,ji) ► (io,q2,j2) ► ••• 

• • • ► (10, qm,Jmj 

Moreover, from the fact that (io,9o,3o) G Xq we see that go G -X"i j , so that 
(go ,e) € -X'igjo- Hence, (<7o>e) labels a path in Cj j from the initial vertex 
to a terminal vertex. It follows easily that (go, e) labels a path 7Ti in B from 
the initial vertex q' to (io,Qo,jo) where go- Similarly, since (i',g',j') G Xi 
we deduce that ((g 1 ')" 1 ^) G Y^j' = ii 0Jm so that B has a path 7r3 from 
(io,Qm,jm) to a terminal vertex labelled ((<7') _1 ,e). 

Composing the paths 7Ti, 7T2 and 7T3, we see that B has a path from the 
initial vertex to a terminal vertex with label 

(goPj i 1 giPj 1 i 2 g2 ■ ■ ■ Pj m ^ 1 i m g m (gT 1 ,x 1 x 2 ...x m ) 

But we know that (io,9o,jo)(h9,j) = (i' ,g' ,j'), so we must have 

goPjoh9lPjii292 ■ ■ ■ Pj m -ii m 9m = g' 

and hence 

goPj iigiPjii 2 92 ■ ■ ■ Pj m -ii m 9m(g'y l = l- 

It follows that w is accepted by the G-automaton B, as required. 

Conversely, suppose w is accepted by the G-automaton B. Then there is 
a path in B from the initial vertex to a terminal vertex labelled (l,w). We 
deduce easily from the construction of B that this path must have the form 
^i^2^3 where 

• 7Ti runs from the start vertex to some vertex (io, qo,jo) with label of 
the form (g ,e) for some g G X ioJO , so that (io,go,jo) G ^05 

• 7T2 runs from (io> ^O) Jo) to a vertex (io, <?m> jm 

) where g m is a terminal 

vertex of A; and 

• 7T3 runs from (io,9m,3m) to a terminal vertex with label ((g')~ l ,e) 
where (V) -1 G Y iojm , so that (io,g',jm) G ^l- 

Moreover, 7T2 must have the form 

(io,qo,jo) ► (io,qi,ji) ► (io,q2,j2j > • • • 

(• P J m -l.«mS"»,Xm) , 

• • • ► Oo, q m , Jm) 

where, since the label of the entire path tt is (l,w), we must have w = 
x 1 ...x m and goPj h9i ■ ■ ■ Pj m -iim9m(g')~ 1 = 1, that is, 

goPjohgi ■ ■ ■ Pj m -ii m 9m = g '■ 

We deduce from the path above and the construction of B that A has a path 

{{h,gi,jl),Xl) {(i2,g2,j2),X 2 ) {(i3,93,j3),X3) ((»m .9m jm) ,X m ) 

qo ► qi ► q2 ► • • • q m 1 > q m 

Since qq and are initial and terminal vertices of A respectively, it follows 
that A accepts (x, w) where 

X = (h,9l,3l)(i2, 92,h) ■ ■ ■ (im,9m,3m)- 
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But (io,go,jo) lies in X and 

(io,9o,3o)x = (io,9o,jo)(h,9i,ji) ■ ■ ■ (im,9m,jm) 

= (io,9oPj il9l ■ ■ ■ Pj m -ii m 9m,3m) 

= (io,g',3m) 

lies in X±, from which we deduce that the rational S'-automaton A accepts 
the word w, and so w £ L as required. □ 
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